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[57] ABSTRACT 

An improved hands-free us er-interact ive control and 
d ialing syste m is disclosed for use wltH a speech com- 
munications device. The control system (400) mcludes a 
dynamic noise suppressor (410), a speech recognizer 
(420) for implementing voice-control, a device control- 
ler (430) responsive to the speech recogniz er for con- 
trolUng operating parameters of the speech communica- 
tions device (450) and for producing status information 
rep resenting the operating status of the device, and a 
speech synt hesize r (440) for pr oviding rep ly informa- 
tion to the user as to the speecn commimications device 
op eratin g' status. In a mobile radiotelephone applica- 
tion, the spectral subtraction noise suppressor (414) is 
configured to improve the performance of the speech 
recognizer (424), the voice quaUty of the transmitted 
audio (417), and the audio switching operation of the 
v ehicularspe^ gc phonc (460 ). The combination of noise 
processing, speech recogmtion, and speech synthesis 
provides a substantial improvement to prior art control 
systems. 


48 Claims, 3 Drawing Sheets 
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, from the speech source while at the same time attempt- 

^^^^^fr^?!^2^J^™^ FOR A ing to pick up the same background noise environm«it 

RADIOTELEPHONE as the first microphone, 

HArirrjpoi rvn tut: ixmcxj^xr . ^ ^"^P^^ high-pass filter is often used, perhaps in a 

BACKGROUND OF THE mVENITON 5 microphone preampUfier. to reduce low frequency 

The present mvention relates generally to speech background noise. This may generally be perceived as 

recognition control systems, and more particularly to a ^ improvement in voice quality, but does little to im- 

handspfree telephone control and dialing system espe- prove the speech recognition process. Another ap- 

dally suited for use in. a noisy environment such as proach, that of spectral subtraction noise suppression, 

encountered in a vehicular radiotelephone application. has typically been used as a noise pre-processor to en- 

In both radio and landline telephone systems, the user hance the noise-degraded speech in preparation for 

typically communicates by means of a handset that further processing by a bandwidth compression system 

includes a speaker at one end which is placed close to such as a vocoder. 

the user's ear, and a microphonie at the other end which Although the aforementioned prior art techniques 

IS held close to the user's mouth. In operation, one hand 15 perform adequately under nominal background 

of the user is occupied holding the telephone handset in noise conditions, the performance of these approaches 

Its proper oncntation. thereby leaving the user's only become severely limited when used m specialized appli- 

free lumd to accomphsh tasks such as dnvin^^ cations such as vehicular speakerphones. The more 

In order to provide a greater degree of freedom for the distant microphone delivers a much poorer signal-to- 

user, speakerphones have commonly been used in land- 20 ^^.^^ P ^ «^ J J> 

SSL?rSrs^^^^^ Rei^ntly, ve^^^^ speaker- noise conditions. In rapidly-chLging hi^ noS 

bSl^Sf e^^J^U r Pr W?7Tfi^f h environments, vehicular bSkgrpund noise may 

S^aS'd^^S^ta^^^^^^^ S JT^r^r^^^^^^ r«:og.ition cont«>l sys^ 

assigned to the same Assignee as the present inv;ition, « ^.i^^JT^f T ^"f^.^^^^^^' performance of 

describe vehicular speake^hones with hands-freroi^?: sP^^H'hone audio switching circuitry may be signifi- 

ation. «iuo-iiw wi^ci cantly unpaired m such environments. 

Hands-free coiitrol systems which are responsive to. therefore, exists for an improved hands-free 
human voice are disclosed m a number of U.S. patents. T^"^ V * ^^^^^^ transceiver that pro- 
U.S. Pat. No. 4,520,576 by Vander Molen discloses a 30 ^^5? 5"*"^®°^ background noise attenuation in high 
conversational voice command control system for a ^oise environments, 
home appliance such as a clothes dryer. The control SUMMARY OF THE INVENTION 
system recognizes voice commands and emits synthe- 
sized speech sounds, in an interaction with the user, to . Accordingly, it is a general object of the present 
obtain the information necessary for setting the operat- 33 invention to provide an unproved method and appara- 
ing parameters. Speech recognition and speech synthe- controlling a speech communications device in a 
sis have also been applied to radio transceiver control ^^^^ environment. 

functions (on/off, transmit/receive, volume and ^ particular object of the present invention is to 

squelch control, etc.) in U.S. Pat. No. 4,426,733 by provide an improved hands-free user-interactive con- 

Brenig. Additionally, U.S. Pat. No. 4,348,550 by Pirz et 40 ^^^^ ^ dialing system for a mobile radiotelephone, 
al, discloses a repertory dialing circuit for a telephone ^ ftirther object of the present invention is to im- 

system which is controlled by the user's spoken word. P^o^^ performance of the radiotelephone's speech 

However, the application of hands-free control to a recognition control system, the voice quality of the 

vehicular speech communications system, such as a transmitted audio, and the audio switching operation of 
mobile radiotelephone, introduces several significant 45 vehicular speakerphone. 

obstacles. When speech recognition is utilized in a ve- In accordance with the present invention, an ira- 
hicular environment, the high degree of ambient noise proved user-interactive control system for a speech 
inherent in a vehicle presents a considerable problem to communications device is provided such that the user's 
reliable voice control. Furthermore, a vehicular speak- hands are free to accomplish other tasks. The control 
erphone typically has a microphone that is distant from 50 system of the present invention includes a means for 
the user's mouth, such as being mounted overhead on dynamically suppressing background noise from an 
the automobile sun visor. Consequently, . the required i°put speech signal; a means responsive to the noise 
high microphone sensitivity causes a large increase in suppression means for recognizing user-spoken com- 
the amount of environmental background noise being mand words; a means responsive to the speech recogni- 
applied to the speech recognizer, as well as being trans- 55 tion means for controlling operating parameters of the 
mitted to the landline party. speech communications device and for producing status 
Numerous approaches to this noisy speech problem information representing the operating status of the 
have been attempted, with only limited success. For device; and a means responsive to such status informa- 
example, it is well known that speech may be enhanced tion for providing an indication to the user as to the 
in an aircraft through the use of a separate microphone^ 60 speech communications device operating status, 
located at a distance away from the user's first micro- In the preferred embodiment, the hands-free user- 
phone, such that it picks up only background noise. The interactive control system is used with a mobUe radio- 
general characteristics of the background noise can then telephone employing a vehicular speakerphone. User- 
be removed by subtracting an estimate of the back- spoken input speech is first acoustically coupled" to the 
ground noise from the desired signal. This technique has 65 control system, then noise-processed by a spectral sub- 
been shown to provide a limited improvement in signal- traction noise suppressor. The noise-processed speech 
to-noise ratio (SNR). However, it is very difficult to information is then applied to a speech recognizer 
achieve the required isolation of the second microphone which provides operating parameter control signals 
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oorresponding to predetermined user-spoken command 
words. A radio-interfacing control unit utilizes these 
control signals to dial telephone numbers spoken by the 
user or recalled from a stored telephone number direc* 
tory in response to a corresponding command word, to 5 
store and recall telephone numbers from this directory^ 
and to control radio ftuictlonal operation. The control 
unit also provides status information to a speech synthe- 
sizer which provides audible feedback to the user as to 
the present operating status of the radiotelephone. Fur- 10 
thennore, noise-suppressed speech is used by the vehic- 
ular speakerphone to improve its switching perfor- 
mance, and used by the radio transmitter to improve the 
quality of the transmitted speech. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The features of the present invention which are be- 
lieved to be novel are set forth with particularity in the 
appended claims. The invention itself, however, to- 
gether with further objects and advantages thereof, may 20 
best be understood by reference to the following de- 
scription when taken in conjunctioii with the accompa- 
nying drawings, in which: 

FIG. 1 is a general block diagram of a speech commu- 
nications device control system according to the pres- 25 
ent invention; 

FIG. 2 is a block diagram of the control system of the 
present invention applied to a speech ccnmnunications 
terminal; 

, FIG. 3 is a block diagram of a speech communica- 30 
tions terminal control system according to the present 

.invention employing a hands-free speakerphone; and 

' FIG. 4 is a detailed block diagram of an embodiment 
of the present invention incorporating a mobile radio- 
telephone hands-tree control system with a vehicular 33 

.speakerphone. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENT 

Referring now to the accompanying drawings, FIG. 40 
/,! shows a general block diagram of user-interactive 
^control system 100 of the present invention. Speech 
!communicadons device 150 may include portions of any 
radio or landline voice communications system, such as, 
for example, 2-way radio systems, telephone systems, 45 
intercom systems, etc. User-spoken input speech is ap- 
plied to microphone 105, which acts as an acoustic 
coupler providhig an electrical input speech signal for 
the control system. Noise processor 110 performs dy- 
namic noise suppression upon the input speech signal to SO 
provide noise suppression information to speech recog- 
nizer 120. Dynamic noise suppression, as used herein, 
refers to the process of adaptively filtering quasi-sta- 
tionary background noise (i.e., noise exhibiting a rela- 
tively constant long-term power spectrum) from the 55 
desired signal. An example of dynamic noise suppres- 
sion is the spectral subtraction or spectral gain modifica- 
tion technique known in the art The noise suppression 
information may be comprised of either noise-sup- 
pressed speech itself, spectxul subtraction noise suppres- 60 
sion parameters to be used In the speech recognizer, or 
both. A further description of noise processor 110, as 
well as the spectral subtraction/spectral gain modifica- 
tion technique, may be found in the description of noise 
processor 410 of FIG. 4. 65 

Speech recognizer 120 utilizes this noise suppression 
information by either directly performing speech recog- 
niuon upon noise-suppressed speech, or by utilizing 
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noise suppression parameters in the speech recognition 
process. Hence, much more accurate speech recogni- 
tion performance is achieved with knowledge of the 
noise content of the speech signal. A further discussion 
of an appropriate speech recognition apparatus, and 
how the preferred embodiment incorporates noise sup- 
pression data into the speech recognizer, may be found 
in the description accompanying FIG. 4. 

Device controller 130 interfaces the control system to 
speech communications device 150. Device controller 
130 translates device control data provided by speech 
recognizer 120 into control signals that can be recog- 
nized by the particular speech communications device. 
These control signals direct the device to perform spe- 
ciiic operating functions as instructed by the user. A 
example of a device controller known in the art and 
suitable for use with the present invention is a micro- 
processor. 

Device controller 130 also provides device status 
data representing the operating status of speech com- 
munications device 150. This data is applied to speech 
synthesizer 140, and translated into user-recognizable 
speech when output via speaker 145. As will be appar- 
ent to those skilled in the art, other means to provide an 
indication to the user as to the speech communications 
device operating status may be utilized. Such indication 
may include a visible display (LED, LCD, CRT, etc.) 
or a sound transducer (tone generator or other audible 
signal). Thus, FIG. 1 illustrates how the present inven- 
tion provides a user-interactive control system utilizing 
noise suppression, speech recognition, and speech syn- 
thesis to control the operating parameters of a speech 
communications device. 

FIG. 2 illustrates the application of the user-interac- 
tive control system to a speech communications termi- 
nal, such as, for example, a telephone terminal, a com- 
munications console, a 2-way radio, etc. Noise proces- 
sor 210, speech recognizer 220, terminal controller 230, 
and speech synthesizer 240, are the same in structure 
and operation as the corresponding blocks of FIG. 1. 
However, control system 200 further illustrates the 
internal structure of speech communications terminal 
250. In this embodiment, microphone 205 and speaker 
295 are incorporated into the speech communications 
terminal itself. A typical example of this microphone/- 
speaker arrangement would be a telephone handset. 
Speech communications terminal 250 also has a trans- 
mitter block 260 coupled to a transmit path 265, a re- 
ceive block 280 coupled to a receive path 2S5, and a 
terminal logic block 270 for controlling both the trans- 
mitter and receiver blocks. Terminal logic block 270 
typically has access to the operating status information 
of speech communications terminal 250, and interfaces 
this information to terminal controller 230 via terminal 
interface path 235. 

The example of a "smart" telephone terminal em- 
ploying voice-controlled dialing from a stored tele- 
phone number directory is now used to describe the 
operation of the control system of the present invention. 
Initially, the user speaks a verbal command into micro- 
phone 205, such as the conmiand word "recall". The 
utterance is first noise-processed by noise processor 210, 
then recognized as a valid user command by speech 
recognizer 220. In this example, terminal controller 230 
then directs speech synthesizer 240 to generate the ver- 
bal reply "name?" via speech synthesis output line 245 
through multiplexer 290 to speaker 295. (For details of 
multiplexer 290, refer to the description of multiplexer 
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470 of FIG. 4.) The user then responds by speaking a trol system 300 may be realized by using noise-sup- 
word such as "office"— a name in the directory index pressed speech for the ^eakerphone audio switching, 
corresponding to a telephone number that he desires to In a high noise, environment, this technique provides a 
dial. The word will be recognized as a valid command significant performance increase to simplex vehicuhir 
word if it corresponds to a predetermined name index 5 speakerphones. Thus, the noise processing block then 
stored in the terminal controller telephone number di- performs three functions: improving speech recognition 
rectory. If valid, controller 230 directs speech synthe- performance; improving transmitted voice quality; and 
azcr 240 to reply "office" thereby confirming the rec- improving speakerphone audio switching, 
og^cd command word. FIG. 4 is a detailed block diagram of the hands-ftee 

The user then says the command word "send", which 10 control system of the present invention. In general, the 
when recognized by the control system, instructs termi- control system arrangement is the same as that of FIG. 
nal controller 230 to obtain die telephone number corre-.. 3, with the above-mentioned exception that the input 
spending to the name "office" and send telephone num- speech signal from the microphone is first noise-proc- 
bcr dialing information to terminal logic block 270 via essed before being applied to the speakerphone. Micro- 
tenmnal interface path 235. Terminal logic block 270 15 phone 402, which is typically remotely-mounted at a 
outputs this dialing information along transmit path 26S distance from the user's mouth (i.e., on the automobile 
via transmitter 260. When the telephone connection is sun visor), acoustically couples the user's voice to con- 
made, terminal receiver 280 provides audio from re- trol system 400. This speech signal is generally ampli- 
ceivc path 285 to speaker 295 via multiplexer 290. If a fied by preamplifier 404 to provide input speech signal 
proper telephone connectioii cannot bie made, terminal 20 405. 

controller 230 reads the status of terminal logic block Noise processor block 410 first converts the analog 
270 and generates status information, such as the reply input speech signal to digital form at analog-to-digital 
word "busy", to be output to the user via speech synthe- converter 412. This digital data is then applied to spec- 
sizer 240. In this manner, user-interactive voice-con- tral subtraction noise suppressor 414^ which performs 
trolled directory dialing is achieved, 25 the actual dynamic noise suppression function. Any 

In addition to . noise-processing operational com- dynamic noise suppression implementation may be uti- 
mands, the user's speech is also noise-processed before it lized in block 414, however, the present embodiment 
is coupled to transmit path 265 via transmit audio line utilizes a particular form of spectral subtraction noise 
215. Hence, noise processor 210 provides noise suppres- suppression— the channel filter-bank technique. Under 
sion information for tiie speech recognizer as well as a 30 this approach, the audio input signal spectrum is divided 
noise-suppressed speech signal for the transmitter audio. into individual spectral bands by a bank of bandpass 
Accordingly, the performance of the control system's filters, and particular spectral bands are attenuated ac- 
speech recognition process as well as the quality of the cordmg to their noise energy content 
transmitted audio signal are substantially unproved. The value of the attenuation is dependent upon the 

Although speech recognition and speech synthesis 35 signal-to-noise ratio (SNR) of the detected signal. The 
allow a vehicle operator to keep both eyes on die road, SNR is calculated from a background noise estimate for 
a conventional handset or hand-held microphone pro- that channel and a channel energy estimate. When voice 
hibits him firom keeping both hands on the steering is present in an individual channel, the channel signal- 
wheel or from executing proper manual (or automatic) to-noise ratio will be high. Thus, the noise suppressor 
transmission shifting. For this reason, the control sys- 40 increases die gain for that particular channel. The 
tem of FIG. 3 incorporates a speakerphone to provide amount of the gain rise is a function of the estimated 
hands-free control ofthe speech communications termi- SNR— the greater the SNR, the more the individual 
nal. The speakerphone performs the transmit/receive channel gain will be raised from the base gain (all noise), 
audio switching function, as well as the received/reply If only noise is present in the individual channel, the 
audio multiplexing function. 45 SNR wfll be low. and die gain for that channel wiU be 

Referring now to FIG. 3, control system 300 utilizes reduced to the base gain. Since voice energy does not 
the same noise processor block 310, speech recognizer ^pcar in all of the channels at the same time, the chan- 
block 320, terminal controller block 330, speech synthe- ncls containing a low voice energy level (mostly back- 
sizer block 340 and speech communications terminal ground noise) will be suppressed (subtracted) from the 
350 as the corresponding blocks of FIG. 2. However. 50 voice, energy spectrum. A spectral subtraction nbise 
microphone 305 and speaker 375 are not an integral part suppression prefiltcr of this type is described in R. J. 
ofthe terminal 350. Instead, speakerphone 360 directs McAulay and M. L. Malpass, "Speech Enhancement 
input speech signal from microphone 305 to noise pro- Using a Soft-Decision Noise Suppression Filter," IEEE 
cesser 310 via input signal line 365. This mputsignal Ime Trans. Acoust, Speech; Signal Pmessing, vol. ASSP-28, 
may be switched in the case of a simplex speakerphone, 55 no. 2, (April 1980). pp. 137-145. 
or may be directly coupled in the case of a duplex Noise suppressor 414 provides noise suppression data 
speakerphone. Speakerphone 360 also controls the mul- 418 for use by speech recognition block 420. This noise 
tiplexing of speech reply line 345 and receive audio line suppression data may consist of actual noise-suppressed 
355 to speaker 375. A more detailed description of the speech, or alternatively may represent spectral subtrac- 
switching/multiplexing configuration of the spieaker- 60 tion noise suppression parameters to be incorporated 
phone IS described later in conjunction with FIG. 4. . into the speech recognition algorithm. In the first case. 

Hence, FIG. 3 illustrates the application of the pres-. speech recognizer 424 would be performing speech 
ait mvention control system to a speech conomunica- recognition upon noise-suppressed speech itself In the 
tions terminal employing a speakerphone to free the latter case, speech recognizer 424 would simply utilize 
user's hands. In the preferred embodiment, spectral 65 the noise suppression data to compensate for the back- 
subtraction noise suppression is utilized to process the ground noise in die speech recognidon process. In the 
mput speech for speech recognition as well as for the present embodiment, this noise suppression data in- 
transmitted audio path. A/urther improvement to con- eludes channel filter-bank data (signal information), a 
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per-channel background noise estimate of the input 
speech signal (noise information), and noise-processed 
signal energy wtUi the current background noise energy 
level (woid boundary information). This signal, noise,, 
and word boundary information is utilized during S 
speech recognition to adjust the word*matching process 
to compensate for high background noise levels. Other 
background noise compensation algorithms for speech 
recognition may also be used, such as those described in 
the article by J. Peckham, J. Green. J. Canning, and P. 10 
Stevens, entitled "A Real-Time Hardware Continuous 
Speech Recognition System," IEEE International Con- 
ference on Acoustics, Speech, and Signal Froc&sing, May 
3-5 1982, vol. 2, pp. 863>866, and the references con- 
tained therein. In either case, noise processing results in 15 
a considerable improvement in speech recognition per- 
formance. 

In the present embodiment, an 8-bit microcomputer 
performs the ftinction of speech recognizer 424, and an 
EEPROM functions as template memory 422. More- 20 
over, several other control system blocks of FIG. 4 are 
implemented in part by the same microcomputer with 
the aid of a CODEC/FILTER and a DSP (digital sig- 
nal processor). The above referenced article describes 
still another microprocessor architecture. Hence, the 25 
.present invention is not limited to any specific hardware 
or any specific type of speech recognition. More partic- 
ularly, the present invention contemplates the use of: 
speaker dependent or speaker independent speech rec- 
ognition; isolated or continuous word recognition; and 30 
a software-based or hardware-based implementation. 

Template memory 422 stores word templates to be 
matchai to the incoming speech in speech recognizer 
424. During training, speech recognizer 424 is in- 
structed by the control unit to send word templates to 35 
template memory 422 via memory bus 426. During 
recognition, speech recognizer 424 compares the previ- 
ously stored templates from memory 422 against noise- 
processed speech information. The recognition algo- 
rithm of the present embodiment incorporates near-con- 40 
tinuous speech recognition, dynamic time warping, 
energy normalization, and a Chebyshev distance metric 
to determine a template match. Prior art recognition 
algorithms, such as described in J. S. Bridle, M. D. 
Brown, and R. M. Chamberlain, '*An Algorithm for 45 
Connected Word Recognition,'* IEEE International 
Conference on Acoustics, Speech and Signal Processing, 
May 3-5 1982, vol. 2, pp. 899-902, may also be used. On 
the whole, speech recognition block 420 utilizes back- 
ground noise information from noise processor block 50 
410 to increase speech recognizer performance in a high 
background noise environment. 

Controller block 430, consisting of control unit 434 
and directory memory 432, serves to interface speech 
recognition block 420 and speech synthesis block 440 to 55 
radiotelephone 450 via interface busses 428, 438 and 458. 
respectively. Control unit 434 is typically a controlling 
microprocessor which is capable of interfacing data 
from radio logic 452 to the other blocks. Control unit 
434 also performs operational control of radiotelephone 60 
450, such as: unlocking the control head; placing a tele- 
phone call; ending a telephone call; etc. Depending on 
the particular hardware interface structure to the radio, 
control unit 434 may incorporate other sub-blocks to 
perform specific control functions as DTMF dialing, 65 
interface bus multiplexing, and control-function deci- 
sion-making. Directory memory 432, an EEPROM, 
stores the plurality of telephone numbers and names. 


thereby permitting directory dialing. Memory bus 436 
sends information to directory memory 432 during the 
process of entering telephone names and numbers, and 
provides this stored directory mformation to control 
unit 434 in response to the recognition of a valid direc- 
tory dialmg command Depending on the particular 
telephone device used, it may be more economical to 
mcorporate durectory memory 432 into the telephone 
device itself. In general, however, controller block 430 
performs the telephone directory storage function, the 
telephone number dialing function, and the radio opera- 
tional control function. 

Controller hlock 430 also provides status mformation 
representing the operating status of the radiotelephone. 
This status information may include information as to 
the names and telephone numbers stored in directory 
memory 432 ("Office", "555-1234", etc.), directory 
status information ("Directory Full", "Name?", etc.), 
speech recognition status information ("Ready", "User 
Number?", etc.), or radiotelephone status information 
("Call Dropped", "System Busy", etc.). Hence, control- 
ler block 430 is the heart of the user-interactive speech 
recognition/speech reply control system of the radio. 

Speech synthesis block 440 performs the voice reply 
ftmction. Voice reply data is fed to channel bank speech 
synthesizer 444 via interface bus 438, Using this infor- 
mation, speech synthesizer 444 recalls reply words from 
reply memory 442, synthesizes these reply words, and 
outputs them to digital-to-analog converter 446. The 
voice reply is then routed to the user. In the present 
embodiment, channel bank speech synthesizer 444 is the 
speech synthesis portion of a 19-channel vocoder. An 
example of such a vocoder may be found in J, N. 
Holmes, *The JSRU Channel Vocoder", lEE PROC. 
vol. 127, pt. F, no. 1, (February, 1980), pp. 53-60. The 
information provided by supply memory 442 may also 
include whether the input speech frame should be 
voiced or unvoiced, the pitch rate if any, and the gain of 
each of the 19 filters. However, as will be obvious to 
those skilled in the art, any speech synthesis apparatus 
may be utilized. Furthermore, the present invention 
contemplates that any means of providing a reply to the 
user would perform the basic reply function of speech 
synthesizer block 440. For example, a visual indication 
(such as a display light) or an audible indication (such as 
a reply tone) may be substituted. 

As we have seen, the present invention teaches the 
implementation of noise suppression with speech recog- 
nition and speech synthesis to provide a user-interactive 
control system for a speech communications device. In 
the present embodiment, the speech communications 
device is a radio transceiver, such as a cellular mobile 
radiotelephone. However, any speech communications 
device warranting hands-free user-interactive operation 
in a noisy environment may be used. For example, any 
simplex radio transceiver requiring hands-free control 
may also take advantage of the improved control sys- 
tem of the present invention. 

Referring now to radiotelephone block 450 of FIG. 4, 
radio logic 452 performs the actual radio operational 
control function. Specifically, it directs frequency syn- 
thesizer 456 to provide channel information to transmit- 
ter 453 and receiver 457. The function of frequency 
synthesizer 456 may also be performed by crystal-con- 
trolled channel oscillators. Duplexer 454 interfaces 
transmitter 453 and receiver 457 to a radio frequency 
(RF) channel via antenna 459. In the case of a simplex 
radio transceiver, the function of duplexer 454 may be 
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performed by an RF switch. For a more detailed expla- 
nation of representative radio transceiver circuitry, 
refer to Motorola Instruction Manual 68P81066E40 
entitled "DYNA T.A.C Cellular MobUe Telephone." 

Speakerphone 460, also termed a VSP (vehicular 5 
speakerphone) in the present apptication, provides: 
hands-free acoustic coupling of the user-spoken audio 
to the control system; the synthesized speech reply 
signal to the user; and the received audio from the ra- 
diotelephone to the user. As previously mentioned, 10 
noise processor block 410 performs spectral subtraction 
noise suppression upon the input speech signal 405, to 
produce noise suppression information for speech rec- 
ognition. This information is also used by digital-to- 
analog converter 416 which produces noise-suppressed \5 
microphone audio 415. The noise-suppressed speech 
signal is appUed to VSP transmit audio switch 462, 
which routes noise-suppressed microphone audio 415 to 
radio transmitter 453 via transmit audio 417. VSP trans- 
mit switch 462 is controlled by VSP signal detector 464. 20 
Signal detecter 44$4 compares microphone audio 415 
against receive audio 455 to perform the VSP switching 
function. 

When the mobile radio user is talking, signial detector 

464 provides a positive control signal via detector out- 25 
put 461 to close transmit audio switch 462, and a nega- 
tive.control signal via detector output 463 to open re- 
ceive audio switch 468. Conversely, when the landline 
party is talking, signal detector 464 provides the oppo- 
site polarity signals to close receive audio switch 468, 30 
while opening transmit audio switch 462. When the 
receive audio switch is closed, receiver audio 455 from 
radiotelephone receiver 457 b routed through receive 
audio switch 468 to multiplexer 470 via switched re- 
ceive audio output 467. In some communications sys- 35 
terns, it may prove advantageous to replace audio 
switches 462 and 468 with variable gain devices that 
provide equal but opposite attenuations in response to 
the control signals from the signal detector. In either 
case, the use of noise-suppressed microphone audio 415, 40 
as opposed to input speech signal 405, assists signal 
detector 464 in makmg accurate audio path control 
decisions. Hence, the noise suppression/speakerphone 
configuration of FIG. 4 significantly improves the noise 
falsing and desensitization performance of a vehicular 45 
speakerphone. 

Multiplexer 470 switches between voice reply audio 
445 and switched receive audio 467 in response to multi- 
plex signal 435 from control unit 434. Whenever the 
control unit sends stams information to the speech syn- 50 
thesizer, multiplexer signal 435 directs multiplexer 470 
to route the voice reply audio to the speaker. VSP audio 

465 is usually amplified by audio amplifier 472 before 
being applied to speaker 475. The vehicle speakerphone 
embodiment described herein is only one of numerous 55 
possible configurations. It should be emphasized, how- 
ever, that the present invention teaches the technkjue of 
utilizing noise-suppressed microphone audio for the 
VSP audio switching. This technique provides a nota- 
ble improvement in speakerphone performance. 60 

In summary, FIG. 4 iUustrates a radiotelephone hav- 
ing a handsp-free user-interactive speech-recognizing 
control system for controlling radiotelephone operating 
parameters upon a user-spoken command. The control 
system provides audible feedback to the user via speech 65 
synthesis as to the radiotelephone operating status. The 
vehicle ^eakerphone provides hands-free acoustic cou- 
plhig of the user-spoken input speech to the control 
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system, the speech reply signal from the control system 
to the user, and the receiver audio to the user. The 
implementation of noise processmg to the control sys- 
tem improves the performance of the radiotelephone's 
speech recognition, the voice quality of the transmitted 
audio, and the audio switching operation of the vehicu- 
lar speakerphone. The combination of noise suppres- 
sion, speech recognition, and speech synthesis provides 
a substantial improvement to prior art control systems. 

While specific embodunents of the present invention 
have been shown and described herein, further modifi> 
cations and improvements may be made by those skilled 
in the art. All such modifications which retain the basic 
underlying principles disclosed and claimed herein are 
within the scope of this mvention. 

What is claimed is: 

1. A speech communications device having a transmit 
path, a receive path, and a user-interactive control sys- 
tem for controlling device operating parameters, said 
control system comprising: 

means for dynamically suppressing background noise 
from an input speech signal to produce noise sup- 
pression information; 

means responsive to said noise suppression informa- 
tion for recognizing user-spoken command words 
to produce device control data; 

means responsive to said control data for controlling 
said device operating parameters, and for produc- 
ing device status data representing the operating 
status of said speech communications device; and 

means responsive to said status data for providing an 
indication to the user as to the speech communica- 
tions device operating status. 

2. The speech communications device according to 
claim 1, wherein said background noise suppressing 
means includes: 

means for generating a noise-suppressed speech signal 
in response to said noise suppression information; 
and 

means for coupling said noise-suppressed speech sig- 
nal to said speech communications device transmit 
path. 

3. The speech communications device according to 
claim 1, further comprising means for placing a tele- 
phone call by voice command, the placement of which 
constitutes one of said operating parameters. 

4. The speech communications device according to 
chum 3, wherem said controlluig means mcludes: 

directory means for storing a plurality of telephone 
numbers; and 

dialing means for dialing telephone numbers obtained 
from said directory means in response to recogni- 
tion of a predetermined verbal command. 

5. The speech communications device according to 
claiin 4, further comprising means for storing a tele- 
phone number in said directory means by voice com- 
mand, the storage of which constitutes one of said oper- 
ating parameters. 

6. The speech conmiunications device according to 
claim 1, wherein said indication providing means is a 
speech synthesizer which synthesizes a verbal reply 
based upon said device status data. 

7. The speech communications device according to 
claim 1, wherein said background noise suppressing 
means uses the spectral gain modification noise suppres- 
sion technique. 

8. The speech communications device according to 
claim 1, ftirthcr comprising radio frequency (RF) com- 
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mtthication means for transmitting information carried 
by said transmit path and for receiving information and 
for coupling said received information to said receive 
path. 

9. A radio communications apparatus having a trans- 
mitter, a receiver, and a speech-recognizing control 
system, said control system comprising: 

means for dynamically suppressing background noise 
from an input speech signal to produce noise sup- 
pression information; 

means responsive to said noise suppression informa- 
tion for recognizing user-spoken command words 

. to produce voice command data; 

means responsive to said voice command data for 
controlling operating functions of said radio com- 
munications apparatus, and for producing voice 
reply data indicative of the operating status of said 
radio communications apparatus; and 

means for synthesizing a speech reply signal from said 
voice reply data, thereby providing an audible 
indication to the user as to the radio communica- 
tions apparatus operating status. 

10. The radio communications apparatus accordmg 
to claim 9, wherein said radio communications appara- 
tus is a mobile radiotelephone. 

11. The radio communications apparatus according 
to claun 9, wherein said background noise suppressing 
means includes: 

means for generating a noise-suppressed speech signal 
in response to said noise suppression information; 
and 

means for coupling said noise-suppressed speech sig- 
nal to said radio communications apparatus trans- 
mitter. 

12. The radio communications apparatus according 
to claim 10, further comprising means for placing a 
telephone call by voice command, the placement of 
which constitutes one of said operating functions. 

13. The radio communications apparatus according 40 
. to claim 12, wherein said controlling means includes: 

directory means for storing a plurality of telephone 
numbers; and 

dialing means for diating telephone numbers obtained 
from said directory means in response to recogni- 
tion of a predetermined verbal command. 

14. The radio communications apparatus according 
to claim 13, further comprising means for storing a 
telephone number in said directory means by voice 
command, the storage of which constitutes one of said 50 
operating functions. 

15. The radio communications s^iparatus according 
to claim 9, wherein said background noise suppressing 
means uses the spectral gain modification noise suppres- 
sion technique. 

16. The radio communications apparatus according 
to claim 9, wherein said noise suppression information 
includes channel-bank information and at least one 
background noise estimate of said input speech signal. 

17. The radio communications apparatus according 
to claim 9, further comprising means for hands-free 
acoustic coupling said user-spoken command words to 
said input speech signal, said speech reply signal to the 
user, and an audio signal from said radio receiver to the 
user. 

18. The radio communications apparatus according 
to claim 17, wherein said hands-free acoustic coupling 
means is a speakerphone. 


19. A radio transceiver having a transmitter, a re- 
ceiver, and a hands-free user control means for control- 
ling a plurality of user-controlled radio transceiver op- 
erating parameters upon a user-spoken command word, 
and for providing audible feedback to the user as to the 
radio transceiver operating status, said control means 
comprising: 

first coupling means for providing hands-free acous- 
tic couplmg of user-spoken input speech to said 
control means, thereby providing an mput speech 

signal; 

means for dynamically suppressing background noise 
from said input speechTsignal by spectral gain mod- 
ification, thereby providing noise suppression data; 
means respon^ve to said noise suppression data for 
recognizing a plurality of predetermined user- 
spoken command words to provide voice com- 
mand data, said plurality of predetermined com- 
mand words corresponding to said plurality of 
radio transceiver operatmg parameters; 
means responsive to said voice command data for 
controllmg said radio transceiver operating param- 
eters, and for producing radio status data indicative 
of the present operating status of said radio trans- 
ceiver, 

means for synthesizing a speech reply signal from said 

radio status data; and 
second coupling means for providing hands-free 
acoustic coupling of said speech reply signal from 
said control means to said user, thereby providing 
audible feedback to the user as to said radio trans-* 
ceiver present operating status. 

20. The radio transceiver according to claim 19, 
wherein said radio transceiver is a mobile radiotele- 
phone. 

21. The radio transceiver according to claim 19, 
wherein said background noise suppressing means in- 
cludes: 

means for generating a noise-suppressed speech signal 

in response to said noise suppression data; and 
means for coupling said noise-suppressed speech sig- 
nal to said radio transmitter. 

22. The radio transceiver according to claim 20, fur- 
ther comprismg means for placing a telephone call by 
voice command, the placement of which constitutes one 
of said plurality of operating parameters. 

. 23. The radio transceiver according to claim 22, 
wherein said controlling means includes: 
directory means for storing a plurality of telephone 

numbers; and 
dialmg means for dialing telephone numbers obtained 
from said directory means in response to recogni- 
tion of a predetermined verbal command. 

24. The radio transceiver according to claim 23, fur- 
ther comprising means for storing a telephone number 
hi said directory means by voice command, the storage 
of which constitutes one of said plurality of operatmg 
parameters. 

25. The radio transceiver according to claim 19, 
wherein said noise suppression data includes channel- 
bank mformation and at least one background noise 
estimate of Sfaid input speech signal. 

26. The radio transceiver according to claim 19, fur- 
ther comprising third coupling means for providing 
hands-free acoustic coupling of a received signal from 
said radio receiver to said user. 
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27. The radio transceiver according to claim 26, 38. The method according to claim 37, wherein said 
wherein said first, second, and third coupling means acoustic coupling steps are performed by a speaker- 
comprise a speakerphone. phone. 

28. The radio transceiver according to claim 19, ^P* method of controlling a radio transceiver 
wherein said radio transceiver operating status includes ^ ^ving a transmit audio path, a receive audio path, a 
information as to the names and telephone numbers Plurality of operating parameters, and a plurality of 
stored in said directory means, said status information °V^^S states, by speech recognition and speech re- 
being user-accessible by voice command. P^^' comprising the steps of: 

29. Tlie method of controlling a radio communica- ^"^^^.^tl'^ 

tions apparatus having a transmitter, a receiver, and a t^hV^^^Hi^^^ ^? u"^"^ transceiver, 

^ech.e^^^^^^ control system, comprising the dySSu^TpS^^^^^^^^^ 

TJ««.j«-n • , -I . , "^P"^ ^^Snal by spectral gain modification, 

dyiiamically suppressmg background noise from an thereby providing noise suppression data; 
input speech signal to produce noise suppression 15 recognizing a plurality of predetermined verbal com- 

infornaation; mand words in response to said noise suppression 

recognizing user-spoken command words hi response data to provide voice command data, said plurality 

to said noise suppression information to produce of predetermined verbal command words corre- 

voice command data; spending to said plurality of operating parameters; 

controlling operating ftmctions of said radio commu- ^0 controlling said radio transceiver operating parame- 

nications apparatus in response to said voice com- response to said voice command data, and 

mand data, and producing voice reply data indica- producing radio status data indicative of the pres- 

tive of the operating status of said radio communi- operating state of said radio transceiver; 

cations apparatus; and synthesmng a speech reply signal from said radio 

synthesizing a spewh reply signal from said voice «,^^!if„fl^^ v r 

reply data, thereby providing an audible mdication P'^^^f "^S hands-free acoustic couplmg of said speech 

f« *K» ^u/^M^ uiuiwttuun j ^^^^ ^^^^ ^^^^ transce ver to said user. 

to the user as to the radio communications appara- fuLlu.. j. j-i^i r ju • % "^".^"^ 

tus operating status. ^^"^^^J producmg audible feedback of said present 

i«u|i|c«uii,5 3uiiu». ^ . . ^ . operating state of said radio transceiver. 

30. The method accordmg to claim 29, wherem said 30 40. The method according to claim 39, wherein said 
radio coramumcaUons apparatus is a mobile radiotele- radio transceiver is a mobile radiotelephone. 
P^?"^^ ^® nicthod according to claun 39, further com- 

31. The method according to claim 29, further com- prising the steps of: 

prising the steps of; generating a noise-suppressed speech signal h re- 
generating a noise-suppressed speech signal in re- 35 sponse to said noise suppression dau; and 

sponse to said noise suppression information; and coupling said noise-suppr^sed speech signal to said 

coupling said noise-suppressed speech signal to said radio transmitter. 

radio communications apparatus transmitter. ^® method according to claim 40, wherein said 

32. The method according to claim 30, wherem said plurality of operating parameters includes placing a 
operating functions include placing a telephone call by ^ telephone call by voice command. 

voice command. ^* method according to claim 42, further com- 

33. The method according to claim 32, further com- ^"^j!!^^^^ ?^ ^ 

prising the steps of- storing a plurality of telephone numbers m a stored 

tory in response to recognition of a predetermined 44. The method according to claim 43, wherein said 

vwtal comm^d. pluraUty of operating parameters further includes stor- 

34. The method accordmg to claim 33. wherein said 50 ing a telephone number in said directory by voice com- 
operating functions further include storing a telephone mand. 

number in said directory by voice command, 45. The method according to claim 39, wherein said 

35. The method according to claim 29, wherein said noise suppression data includes channel-bank informa- 
background noise suppressing step uses the spectral tion, and at least one background noise estimate of said 
gain modification noise suppression technique. ^^P^^ speech signal. 

36. The method according to claim 29, wherein said ^ ^® method according to claim 39, further com- 
noisc suppression information includes channel-bank Posing the step of providing hands-free acoustic cou- 
information and at least one background noise estimate P^S of a received signal frpm said receive audio path 
ofsaid input speech signal ^ ^^H^J^^' ^ 

37. The metiiod according to claim 29, further com- ^ according to claim 46, wherein said 
prising the steps of: acoustic couphng steps are performed by a speaker- 

'"^nJiif/n T^^^^f said user-spoken command ^ ^"'^^ ^^^^ ^j^^ 

wo ds to said input speech signal; ^^io transceiver opera^g status inc ud^Tfoma- 

acoustically couphng said speech reply signal to the 65 tion as to the names aid telephone numbers stor^Si 

user, and . . , telephone number directory, said status informa- 

acousticaUy .coupImg an audio signal from said radio tion being user-accessible by voice command, 

receiver to the user. * • • « « 
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